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Abstract 

The classical Erdos-Ko-Rado theorem states that if k < [n/2j 
then the largest family of pairwise intersecting fe-subsets of [n] = 
{0, is of size A family of k subsets satisfying this 

pairwise intersecting property is called an EKR family. We gen¬ 
eralize the EKR property and provide asymptotic lower bounds on 
the size of the largest family A of fc-subsets of [n] that satisfies the 
following property: For each A,B,C G A, each of the four sets 
C^; A n B^ n C; A^ D B D C are non-empty. 
This generalized EKR (GEKR) property is motivated, generalizations 
are suggested, and a comparison is made with fixed weight 3-covering 
arrays. Our techniques are probabilistic, and reminiscent of those used 
in [S] and in the work of Roux, as cited in [Hj . 


1 Introduction 

The classical Erdos-Ko-Rado (EKR) theorem (jH]]) states that if k < [n/2J 
then the largest family of pairwise intersecting /c-subsets of [n] = (0,1,..., n} 
is of size (^l]), and is given, moreover, by the class of subsets of [n] containing 
a hxed element a. A family of pairwise intersecting k subsets is called an 
EKR family. We generalize the EKR property and provide asymptotic lower 
bounds on the size of the largest family A of Usubsets of [n] that satishes 
the following property: For each A,B,C G A, each of the four sets An B n 
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C] An B n C^] A n n C-, A^^ n B nc are non-empty, where A‘^ denotes 
the complement of the set A. 

Now why might such a property be of interest? Here is motivation for 
our choice of this somewhat unusual generalized EKR (GEKR) property, 
together with possible extensions. The EKR theorem can be thought of in 
several ways, but one is the following: If each person speaks a different set of 
k languages out of a total of n, what is the largest number of people that can 
have two-way conversations with each other? Or, in a “small world network” 
context, if each person knows a unique set of k others, what is the largest 
number of people possible so that any two have a mutual acquaintance? Now 
imagine that we desire a situation in which any two of any three people can 
have a conversation with no fear of the third eavesdropping, and yet all three 
are able to communicate if necessary. Or one in which any two of any three 
people have a common friend who is a stranger to the third, while there 
also exists a person who is a mutual friend of all three. We seek therefore, 
to construct a family of “partially 3-independent sets” (see, e.g., |2] for the 
dehnition of fc-independence.) Our dehnition can be extended to one in 
which, given any three people, 1 < /9 < 8 specihed regions in the associated 
Venn diagram are non-empty. This situation can clearly be generalized to 
more than three sets, but we choose not to do so. Another possibility might 
be (using the language analogy) the following: Language ability is at three 
levels: ignorant (0); novice (1), and expert (2). Two persons who are at the 
novice and expert level at a particular language can converse at the novice 
level, and thus keep keep their conversation secret from the third, assumed 
to be ignorant. Likewise, two persons at the level ‘2’ can speak rapidly and 
at a high level, thus keeping their conversation undecipherable to a third who 
is at the ‘0’ or ‘1’ level. Three persons can converse if there exists a language 
at which each is at level 1 or above. 

Key to our development is a basic comparison between the GEKR prop¬ 
erty and hxed weight binary 3-covering arrays ( [3] and [S] are comprehensive 
surveys, and both provide an exhaustive list of references on covering arrays). 
Recall that a t-covering array C(n, q, t, A) is dehned as an m x n array satis¬ 
fying the property that for any choice of t rows, each of the g* g-ary t-tuples 
appear at least A times among the columns of the selected rows. We dehne the 
partial t-covering array C{n, q,t,AO {0,1,..., g—1}*, A) to be an mxn array 
satisfying the property that for any choice of t rows, each of the g-ary t-tuples 
in the ensemble A appear at least A times among the columns of the selected 
rows. We thus see that the GEKR property holds for m “people” if and only 


2 


if the m X n “person-language” incidence matrix is a partial C{n,2,3, A,l) 
covering array with A = {(0,1,1) U (1, 0,1) U (1,1, 0) U (1,1,1))}, and with 
each row having weight k. We seek to hnd lower bounds on the maximum 
number of rows m so that such an array exists, and our results are typically 
of the form m > C” for some constant C. Our techniques are probabilistic, 
and reminiscent of those used in [S] and by Roux, the latter as cited in (Hj. 
Connections between the EKR property and t-covering arrays have been ex¬ 
ploited in a different context by researchers such as Karen Meagher, Lucia 
Moura, and Brett Stevens in Ottawa. For example, in [7j, EKR theorems 
for are proved for set partitions. These are connected to strength-2 covering 
arrays with any alphabet rather than strength-3 binary covering arrays. See 
also jB]. 

Our results are presented in the next section. Several other generaliza¬ 
tions of the EKR theorem are studied in H, i, 171, and jH]; the references 
contained in these sources span, between them, almost 45 years of develop¬ 
ments. 


2 Results 

We start by considering a result in which each row of the mxn array is allowed 
to have expected weight k. Specihcally, the strategy is to independently place 
a one in any of the m ■ n places with probability a = k/n and a zero with 
probability 1 — a. A set of three rows is said to be deficient if the rows do 
not contain one of the four vectors (0,1,1), (1, 0,1), (1,1, 0), (1,1,1) among 

their columns. Set X = where Ij equals one or zero according as 

the jth set of three rows is dehcient (or not). It is evident that {X = 0} if 
and only if the array satishes the GEKR property, so that P(W = 0) > 0 <;=> 
the appropriate partial 3-covering array can be constructed. The Lovasz 
local lemma is used extensively throughout the paper - it provides a simple 
condition which guarantees a positive probability for the event that no set of 
three rows is dehcient; see [T] for this result, stated below for convenience: 

Lemma 1 Let Ai, A 2 ,..., Aj^ he events in an arbitrary probability space. 
Suppose that each event Ai is mutually independent of a set of all the other 
events Aj but at most d, and that P(Ai) < p for alll <i < N. If ep{d+l) < 1 
then P(nJli Af) > 0. 


3 


Theorem 2 Consider an m x n array of zeros and ones with eaeh entry 
being independently chosen to he a one with probability a and a zero with 
probability 1 — a. Let X be the number of sets of three deficient rows. Let 
A = {(0,1,1) U (1, 0,1) U (1,1, 0) U (1,1,1)}. Then, 

m < =Cin)^ P(X = 0) > 0, 

V 36 \Pn\^)) 

where Pn{,Oi) = (1 — + 3(1 — q;^(1 — a))"', so that the largest number of 

rows in aC(n,2,3, A,l) covering array, with each row having expected weight 
k = a ■ n, is at least C(^)- 

Proof We use the Lovasz local lemma. Let Ai be the event that the ith set 
of 3 rows is dehcient, so that we have, for each i, 

P(Aj) = P I {j is missing} 

\j&A 

< P((l, 1,1) is missing) + 3P((1,1, 0) is missing) 

= (1-0^)’^ +3(1-02(1-0))’^. 


Now, it is evident that the dependence number d satishes d + 1 < 3 (*"2 < 
3m^f 2, so that P(X = 0) > 0 provided that 

2 / \ 

—m pn{a) < 1, 


i.e., if 


as asserted. 


m < 


(pn(a)) 




Remarks. Notice that 


Pn{a) 


4(1)" ifo = | 

3(1 — 0^(1 — o))’’(l + 0 ( 1 )) if o > I 
(1 — o^)’’(l + 0 ( 1 )) if o < |. 


□ 


It follows, due to the fact that (1 — o^)” is monotone decreasing and 3(1 — 
0^(1 — o))" is decreasing on the interval [1/2, 2/3], that the function Pn(o) 
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attains its minimum at a = 2/3 - yielding the conclusion that the maximum 
size of a C(n, 2, 3, A, 1) covering array appears to be when the expected weight 
of each row is | ■ n. Also, a lower bound on this size is 

■ (1.0834...)”. 

Furthermore, the rate of growth of the lower bound is exponential no matter 
what a > 0 is. It is also worth mentioning that nowhere in this paper can 
one obtain improved asymptotic results by using a more careful estimation 
of the probability of sets of rows being dehcient, using, for example, the 
inclusion-exclusion principle. 

Might we have been able to incorporate into our analysis the rather re¬ 
alistic case where a = an ^ oo]an/n 0? Since in this case pn‘^{a) ~ 
(1 — ~ exp{—nQ;)^/2}, we need, at the very least, to have oo 

for the size of the array to grow to inhnity with n. But the results are entirely 
satisfactory in this case. For example, with n = 10® and = log^'^^ = 

.0075, we get a lower bound of ~ 2 ■ 10®^ on the size of the array. 

Next we turn to our main result, which, while being similar in spirit to 
Theorem 2, is based on the GEKR property holding for fixed weight rows. 
The analysis gets more complicated due to the fact that entries within each 
row are no longer independent, but the dependence structure between rows 
stays the same. We shall see, moreover, that a better bound is obtained in 
the hxed weight case unless both a and n are small. 

Theorem 3 Consider an m x n array of zeros and ones with each row con¬ 
taining k = an ones and n — k = {1 — a)n zeros (a G Q fl (0,1); an G 
so that each of the (^) configurations of ones and zeros are egually likely. Let 
X he the number of sets of three deficient rows. Let A = {(0,1,1) U (1, 0,1) U 
(1,1, 0) U (1,1,1)}. Then there exists K > 0 such that 

m < ^- ) (1 + o(l)) := nn(a) => P(X = 0) > 0, 

“ \VKp{a)-CnC*J ^ ^ ^ ^ ^ ’ 

where p,{a) is given on the last line of the proof that follows. In other words, 
the largest number of rows in a C{n, 2, 3, A, 1) covering array is at least z/„. 

Proof We set X = before, X is the number of dehcient sets of 

three rows. As before, the d in the Lovasz local lemma can be bounded by 



5 



?)m?/2, and the main problem is to estimate the probability that a given set 
of rows is deficient. We have 


P 


F{Ij = 1) < IP((1,1, l)is missing) + 3P((1,1, 0)is missing) 

/ n \ /an\ /n—an\ /n—u\ f \ (n—(yn\ / n—u \ 

\an) V u ) \an-u) V an ) ^ ^ V a.n) \ u ) \a.n—u) \n—an) 


E 


(") 

\an/ 

K n—r\ fn—u\ 
r—uJ \ r / 




+ 3E 


/r\ fn—r\ /n—u\ 
\uJ \r—u) \n—r/ 

0 " 


(”)■ 

\an/ 


{an = r) 


= ^4>{u)+ ^'tl){u) (say). 

1 2 

Consider Si hrst. We must have, in this first sum, 

u < minjan, (1 — a)n} = min{r, n — r}; 

u > max{0, {2a — l)n} = max{0, 2r — n}. 
With this in mind, consider the ratio 


•#>(“) C)C:;)(”r) 


of successive terms, which may be checked to exceed one if and only if 

(n — r + 2)u^ + (r^ — 2r — — n + l)u + {nr‘^ — + 2nr — n) >0. 

The idea is to hnd which value of u maximizes 4>{u). Towards this end, we 
see that 0 is increasing if and only if n < ni;M > ^ 2 , where the smaller and 
larger roots ui,U 2 of the above quadratic are respectively given by 

= 2{n - r + 2) ' ( “ ^ 1 , 

and ^ 

“2 = 7T7--TtT ■ ( - + 2r + + n - 1 + \^'y{n,r )), 

2(n — r + 2) V / 

with 

7 (n, r) = 1 — 2nr^ — 16nr — 4r + 6n + 6r^ + lln^ 

+4r^ — 6r^n^ — 3r^ + + 6n^ — 8rn^ + 8r^n. 
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We simplify by reintroducing the parameter a; an = r, and setting 


e = 1 — 3a‘^ — 6a^ + 8q;^; 


/ = —2a‘^ + 4q;^ + 6 — So:; 

e and / are, respectively, the coefficients of the fourth and third order terms 
in the radical above. We thus get, noting that e > 0 for a < 1, 


“■ = (2n - Ln + 4) ' ^ 

and 

= (2n-2L + 4) ■ ^ + ®<^)) ■ 

The critical points we need to investigate, together with the endpoints of the 
summation, are thus (within 0(1) of) 


Ml 

, approx 


(1 - - y/e) 

-—^^-• n 

(2 - 2a) 

(1 — a^ — Vl — 3 q;^ — 6q;2 + Sq;^) 
(2 - 2a) 

P ■ n = P{a)n 


and 


(1 — a^ + Je) (1 — a^ + Vl — So;^ — + Sq;^) 

Mo aDDrox = -;-^- ' n = -;-r- ■ U. 

2,approx 

Consider Figure 1 below which graphs respectively 1/n times (i) the lower 
limit of summation of Si, given by the straight line; (ii) the upper limit of 
summation of Si, given by the triangular plot; and (hi) Mi^approx, given by the 
increasing curve; and (iv) M 2 ,approx, the unimodal curve. Note the interesting 
(but obvious in hindsight) fact that the lower limit of summation exceeds the 
upper limit if a > 2/3 and thus IP((1,1,1) is missing) equals zero in this case. 
Thus the maximum of 0(m) is attained at mi, approx + 0(1) for each a < 2/3. 
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Consider S 2 , in which the limits of summation are 

0 V {2a — l)n <u< an. 

Now we hnd that 

%Ij{u + 1 ) {an — u)^ 

'ip{u) {u + l){n — 2an + u + l){n ■ 
if and only if au^ + bu + c > 0, where 



b = —Sa^Ti^ — 71^ + 2an^ — n — 2an + 1 ; 

and 

c = + 2an‘^ — n. 

The roots vi and V 2 of the qnadratic au^ + 6m + c = 0 are 

—-— (sa^ri^ + — 2an^ + n + 2an — 1 ± \/5{n^ a)^ 

^{oiTi ~\~ 2j \ / 

where 

5{n,a) = 1 — Aan — 20^171^ — Aav? + + 7 v?'+ Qn + 

+ lOa^n^ — 12Q;^n"^ — lOa^n^ — Aan^ + 2n^ + + Aan^. 



V2 5 approx 
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Figure 2 

The functions ui^approx/''^;'y 2 ,approx/''^;« and (2q; — 1) are plotted in Figure 2. 
These graphs reveal that the maximum of 'ip{u) occurs at ui,approx + 0(1)- 
Returning to Si, we estimate as follows; throughout this paper K will 
denote a generic constant whose value might change from line to line. 

< iFn-max0(u) 

1 

^ Kn • 0(rti^approx T 
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< 


Kn 


/an\ / n—an \ /n—0n\ 
\(3n/ \an—f3n/ \ an ) 



Stirling’s approximation N\ ^ \/2t^N (iV/e)^, applied to each of the binomial 
coefficients above, yields after some simplihcation, 




< Kn(p{(3n) 



= K^/n^'^ say. 


^)2a-2/3(l -2a + /3)i-2«+/3(l - a - 


n 


We treat S 2 in a similar fashion: For each a G [0,1], 


< Kn ■ m.sj^'ip{u) 

2 

< Kn- approx) 

fan\ / n—an \ fn—Kn\ 

_ XKn)\an—Kn)\n—an) 

r? ■ 

\an/ 

Stirling’s approximation again yields after some simplihcation, 




< Kn(j){Kn) 



Q,3a(i _ q,^2 K 

K'^{a - -2a + k ) i - 2 «+« 

say. 


n 


In Figure 3 we have plotted .^(o;) (the concave down curve) and 0(a); note 
that the domain of ^ is [0,2/3], while 0 is dehned on [0,1]. It follows that 
the El sum dominates the probability of three specihc rows being dehcient 
if a < 1/2, with S 2 taking over for a > 1/2 just as in the case of rows 
with expected weight k. There is a signihcant difference in the two results, 
however. Figure 4, which plots 0 in the vicinity of a = 0.74, reveals that 
the minimum of 0 is attained around a = 0.7395 in contrast to the fact that 
the minimum was attained at a = 2/3 in the “independent” case. Thus our 
proof suggests that the maximal size of a generalized EKR array occurs when 
a ^ 0.7395. 
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We now finish off the proof of Theorem 3. By the Lovasz local lemma, 
P(X = 0) > 0 whenever ?>epw?/2 < 1, or if 

+ <1, (a<l/2); 

{l + o{l))K^e^m^ <1, (a >1/2). 


We now simply set fj, = ^ if a < 1/2 and fi = 9 if a > 1/2. Theorem 3 
follows. □ 



A Remark and Numerical Values. First note that we recover the result 
of Roux, as discussed in jS] , on setting a = 1/2 (this is the highly symmetrical 
case studied earlier in the literature.) In this case we get 

_3-y5^ _ 3-^5 

^1,approx ^1,approx ^ p K ^ . 

Second, we provide below in Tables 1 and 2 some numerical values for various 
values of a, for the independent and hxed weight models studied in Theorems 
2 and 3 respectively. While computing the bounds given by Theorem 3, we 
have ignored the effect of the (1 + o(l)) term, and pretended that K = 1. 
The “interesting” first choices for a are each roughly 1/6, correspond to 


= 10, 000, and yield m 

6.5 ■ 10® - the world’s current population. 

Table 1 

The Independent Model 

a/n 

10,000 

100,000 

300,000 

1,000,000 

0.1669 

6.51 ■ 10^ 

7.66 ■ 10809 

1.83 ■ 10808 

3.88 ■ 108011 

0.2 

1.37-101^ 

1.29 ■ 10874 

8.79 ■ 10822 

7.22 ■ 108743 

1/3 

4.34 ■ 10®^ 

1.64 ■ 10889 

1.81 • 102458 

8.00 ■ 108894 

0.5 

2.26 ■ 10289 

9.80 ■ 102898 

1.53 ■ 108098 

2.33 ■ 1028995 

2/3 

4.32 ■ 10847 

1.79 ■ 108481 

7.00 ■ 1080444 

2.63 ■ 1034817 

0.7395 

1.50 ■ 10883 

4.61 ■ 108836 

1.20 ■ 1080011 

3.37 ■ 1083371 

0.8 

7.47 ■ 10296 

4.28 ■ 102973 

9.63 ■ 108921 

1.64 ■ 1029741 




Table 2 





The Fixed Weight Model 


a/n 

10,000 

100,000 

300,000 

1,000,000 

0.1685 

6.50 ■ 109 

7.61 ■ 10806 

1.06 ■ 10823 

6.52 ■ 108079 

0.2 

2.32 ■ 1087 

2.57-10882 

4.08 ■ 10849 

1.26 ■ 108835 

1/3 

9.50 ■ 1092 

3.39 ■ 10938 

9.36 ■ 102817 

1.99 ■ 109396 

0.5 

9.00 ■ 10896 

1.97- 108978 

1.84 ■ 1081937 

8.82 ■ 1089793 

2/3 

4.50 ■ 10880 

1.93 ■ 108315 

1.73 ■ 1088948 

7.27 ■ 1083163 

0.7395 

3.28 ■ 10848 

8.27 ■ 108493 

1.35 ■ 1086484 

1.46 ■ 1084950 

0.8 

1.74 ■ 10833 

1.48 ■ 108341 

7.86 ■ 1086025 

5.19 ■ 1083422 
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3 Open Questions 

Of possible interest might be lower bounds on the size of partial t-covering 
arrays C{n,q,t, A, X) for general values of the parameters. More crucial, 
however, would be construction, algorithms, and improvements - especially 
for small values of n - for the baseline case studied in this paper; t = 3, q = 
2; A = 1, and A as specihed by the GEKR property. Also, we feel that the 
situation where a = an varies as a function of n so that an/n 0 needs 
further investigation. Last but not least, Karen Meagher has asked a deep 
question; Can the methods in this paper be used to try to get a generalized 
Sperner property? Perhaps it might be possible to use a generalized Sperner 
type result to say that the largest system (not necessarily of k-sets) that 
satishes the generalized EKR property is a system of k-sets. 
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